NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

RaGNNarok: A Light-Weight Graph Neural Network for Enhancing Radar Point Clouds on Unmanned Ground Vehicles

Hunt, David; Luo, Shaocheng; Hallyburton, Spencer; Nillongo, Shafii; Li, Yi; Chen, Tingjun; Pajic, Miroslav (October 2025, 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS))

Free, publicly-accessible full text available October 19, 2026
Variational Adversarial Training Towards Policies with Improved Robustness

Dong, Juncheng; Hsu, Hao-Lun; Gao, Qitong; Tarokh, Vahid; Pajic, Miroslav (May 2025, The 28th International Conference on Artificial Intelligence and Statistics)

Free, publicly-accessible full text available May 3, 2026
Efficient Neuro-Symbolic Policy using In-Memory Computing

Molom-Ochir, Tergel; Saxena, Naman; Kim, Jiwoo; Chen, Yiran; Wang, Zhangyang; Pajic, Miroslav; Li, Hai (May 2025, International Conference on Neuro-symbolic Systems (NeuS))

Free, publicly-accessible full text available May 28, 2026
Attacks on Perception-Based Control Systems: Modeling and Fundamental Limits

https://doi.org/10.1109/TAC.2024.3401022

Khazraei, Amir; Pfister, Henry D; Pajic, Miroslav (November 2024, IEEE Transactions on Automatic Control)

Full Text Available
Off-Policy Evaluation for Human Feedback

Gao, Qitong; Gao, Ge; Dong, Juncheng; Tarokh, Vahid; Chi, Min; Pajic, Miroslav (December 2024, The Thirty-Eighth Annual Conference on Neural Information Processing Systems)

Full Text Available
Learning Optimal Strategies for Temporal Tasks in Stochastic Games

https://doi.org/10.1109/TAC.2024.3390848

Bozkurt, Alper Kamil; Wang, Yu; Zavlanos, Michael M; Pajic, Miroslav (November 2024, IEEE Transactions on Automatic Control)

Full Text Available
RadCloud: Real-Time High-Resolution Point Cloud Generation Using Low-Cost mmWave Radars for Aerial and Ground Vehicles

https://doi.org/10.1145/3636534.3698849

Hunt, David; Luo, Shaocheng; Khazraei, Amir; Zhang, Xiao; Hallyburton, Spencer; Chen, Tingjun; Pajic, Miroslav (December 2024, ACM)

Full Text Available
Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning

Hsu, Hao-Lun; Wang, Weixin; Pajic, Miroslav; Xu, Pan (September 2024, Advances in Neural Information Processing Systems)

We present the first study on provably efficient randomized exploration in cooperative multi-agent reinforcement learning (MARL). We propose a unified algorithm framework for randomized exploration in parallel Markov Decision Processes (MDPs), and two Thompson Sampling (TS)-type algorithms, CoopTS-PHE and CoopTS-LMC, incorporating the perturbed-history exploration (PHE) strategy and the Langevin Monte Carlo exploration (LMC) strategy, respectively, which are flexible in design and easy to implement in practice. For a special class of parallel MDPs where the transition is (approximately) linear, we theoretically prove that both CoopTS-PHE and CoopTS-LMC achieve a $$\widetilde{\mathcal{O}}(d^{3/2}H^2\sqrt{MK})$$ regret bound with communication complexity $$\widetilde{\mathcal{O}}(dHM^2)$$, where $$d$$ is the feature dimension, $$H$$ is the horizon length, $$M$$ is the number of agents, and $$K$$ is the number of episodes. This is the first theoretical result for randomized exploration in cooperative MARL. We evaluate our proposed method on multiple parallel RL environments, including a deep exploration problem (i.e., $$N$$-chain), a video game, and a real-world problem in energy systems. Our experimental results support that our framework can achieve better performance, even under conditions of misspecified transition models. Additionally, we establish a connection between our unified framework and the practical application of federated learning.
more » « less
Full Text Available
Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning

Hsu, Hao-Lun; Wang, Weixin; Pajic, Miroslav; Xu, Pan (September 2024, Advances in Neural Information Processing Systems)

We present the first study on provably efficient randomized exploration in cooperative multi-agent reinforcement learning (MARL). We propose a unified algorithm framework for randomized exploration in parallel Markov Decision Processes (MDPs), and two Thompson Sampling (TS)-type algorithms, CoopTS-PHE and CoopTS-LMC, incorporating the perturbed-history exploration (PHE) strategy and the Langevin Monte Carlo exploration (LMC) strategy, respectively, which are flexible in design and easy to implement in practice. For a special class of parallel MDPs where the transition is (approximately) linear, we theoretically prove that both CoopTS-PHE and CoopTS-LMC achieve a $$\widetilde{\mathcal{O}}(d^{3/2}H^2\sqrt{MK})$$ regret bound with communication complexity $$\widetilde{\mathcal{O}}(dHM^2)$$, where $$d$$ is the feature dimension, $$H$$ is the horizon length, $$M$$ is the number of agents, and $$K$$ is the number of episodes. This is the first theoretical result for randomized exploration in cooperative MARL. We evaluate our proposed method on multiple parallel RL environments, including a deep exploration problem (i.e., $$N$$-chain), a video game, and a real-world problem in energy systems. Our experimental results support that our framework can achieve better performance, even under conditions of misspecified transition models. Additionally, we establish a connection between our unified framework and the practical application of federated learning.
more » « less
Full Text Available
Robust exploration with adversary via Langevin Monte Carlo

Hsu, Hao-Lun; Pajic, Miroslav (July 2024, The 6th Annual Learning for Dynamics & Control Conference)

Full Text Available

« Prev Next »

Search for: All records